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ggregation of corporate social performance (CSP) metrics poses a major challenge to researchers and practitioners. This 

study provides a critical evaluation of current aggregation approaches and proposes a new methodology based on data 
envelopment analysis (DEA) to compute a CSP index. DEA is independent of subjective weight specifications and provides 
an efficiency index to benchmark the CSP of firms. Using CSP data from 2190 firms in three major industries from the Kinder, 
Lydenberg, and Domini, Inc. database in 2007, our study presents the first application of the DEA model for CSP and ordinal 
data and opens up a new path for future empirical CSP research. 
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1. Introduction 


Stakeholders are becoming more and more concerned 
about the corporate social performance (CSP) of firms’ 
operations. CSP can be defined as “a construct that 
emphasizes a company’s responsibilities to multiple 
stakeholders, such as employees and the community 
at large, in addition to its traditional responsibilities to 
economic shareholders” (Turban and Greening 1996, 
p. 658). For example, investors are increasingly using 
socially responsible investing (SRI) screens to select or 
avoid investing in firms according to their environ- 
mental and social preferences (Chatterji et al. 2009, 
Delmas and Doctori-Blass 2010). Similarly, a growing 
number of consumers purchase eco-labeled products 
that signal a lower environmental and social impact of 
corporate operations (Loureiro and Lotade 2005). 
Some corporations are also developing socially re- 
sponsible purchasing practices to promote more 
sustainable supply chains (Atasu et al. 2008, Bowen 
et al. 2001, Carter 2008, Delmas and Montiel 2009, 
Drumwright 1994, Seuring and Müller 2008, Srivast- 
ava 2007). However, measuring CSP has proven to be 
a daunting task because it represents a broad range of 
economic, social, and environmental impacts caused 
by business operations and thus requires multiple 
metrics to fully cover its scope (Gond and Crane 2010, 
Rowley and Berman 2000). 

As a result, researchers often need aggregate CSP 
indicators to assess the overall CSP of firms. Most 
empirical studies on CSP use simple linear aggre- 
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gations, weighted or non-weighted, to derive a com- 
posite CSP score from a selection of CSP metrics. 
These types of approaches would seem appropriate 
when the weights are exogenously given. For exam- 
ple, NGOs may have a specific weighting scheme 
based on the priorities of their members. However, 
for managers who face a variety of stakeholder pres- 
sures, the choice of weights is more ambiguous. 
Specifically, one primary stakeholder group (e.g., 
customers) may very well hold opinions that conflict 
with those of another primary or secondary group 
(e.g., employees) about the same corporate social 
policy of a firm (Clarkson 1995, Delmas and Toffel 
2008). In addition, because stakeholder characteris- 
tics and preferences can shift dramatically under 
different contexts and times (Griffin 2000), prioritiz- 
ing CSP issues—such as climate change, employee 
relations, and human rights—can turn into a formi- 
dable task. 

Furthermore, CSP assessment contains both nega- 
tive and positive indicators to represent strengths 
and concerns regarding CSP practices. For example, 
generously giving to charities in the community is 
often perceived as a positive practice, whereas invest- 
ments that would lead to controversies might be 
considered detrimental to CSP. Similarly, the use of 
clean energy is often considered a positive practice, 
whereas making profits from fossil fuel products 
might be considered negative because of the impact 
on climate change. When stakeholders want to 
balance concerns over strengths, they also face the 
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challenge of assessing the respective importance of 
different CSP issues. 

Considering the multiple dimensions of the CSP 
construct, we argue the existing CSP aggregation 
methodologies fail to provide an effective measure of 
CSP. We show that the scores resulting from these 
aggregation methodologies differ in terms of their 
median and variance and are sensitive to changes in 
aggregation weights. This sensitivity can be fairly 
problematic. As expressing CSP through an aggregate 
measure is necessary for most analyses, we propose 
an alternative methodology to calculate a CSP index. 
Our methodology is based on data envelopment anal- 
ysis (DEA), a mathematical programming method for 
evaluating the relative efficiencies of firms (Charnes 
et al. 1978, Cook and Zhu 2006) that does not require 
a priori weights to aggregate different CSP issues. 

DEA computes an efficient frontier that represents 
the best performers in a peer group. The DEA CSP 
score represents the distance of a firm to the efficient 
frontier and the extent to which a firm can reduce its 
current concerns, given its strengths relative to those 
of the best performers. We argue that DEA has several 
advantages in addressing the challenges of assessing 
CSP. First, DEA produces a ratio index that incorpo- 
rates both good and bad CSP metrics. Second, DEA 
does not require an a priori weight specification for 
different CSP indicators. Third, the DEA score repre- 
sents the distance to the efficient frontier and is easy 
to interpret. These features help compare firms’ CSP 
both within and across industries. 

To meet the ordinal nature of the CSP data, we use 
the DEA model for rank order data (Cook and Zhu 
2006) and present the first large-scale empirical ap- 
plication of DEA to ordinal data. Our model is 
inspired by the study conducted by Bendheim et al. 
(1998), who used DEA to assess best management 
practices regarding stakeholder relations. However, 
their study did not consider the trade-offs between 
strengths and concerns. Our model can be contrasted 
with previous eco-efficiency studies based on DEA 
(e.g., Dyckhoff and Allen 2001, Fare et al. 2006, Ku- 
osmanen and Kortelainen 2007), which draw on 
concrete quantities of environmental data such as 
total CO, and SO, emissions. 

In this paper, we focus on the Kinder, Lydenberg, 
and Domini, Inc. (KLD) database, currently the most 
widely used and comprehensive information source 
for CSP research (Waddock 2003). KLD publishes the 
CSP ratings of major publicly traded companies in the 
United States, and the data cover areas of environmen- 
tal performance, social contribution, corporate governance, 
and controversial business involvement. Our empirical 
analysis shows that DEA is more robust than the 
existing CSP aggregation methodologies, whose ag- 
gregation results are sensitive to weight changes. Our 


analysis also highlights the ease of interpretation of 
the DEA score for benchmarking purposes. 

In the next section, we review the empirical CSP 
literature and outline the advantages of DEA and its 
formulations. In section 3, we compute the DEA effi- 
ciency score for CSP using KLD data and compare our 
results with those using prior existing aggregation 
methodologies. In section 4, we summarize our find- 
ings and suggest directions for future CSP research 
based on the DEA methodology. 


2. Literature Review: Measuring CSP 


Because the full spectrum of CSP is broad, generating 
a proxy that can reflect its full scope is challenging. 
Although corporate financial performance indicators 
are clearly defined and readily available (like return 
on assets, return on investment, etc.), the CSP coun- 
terparts are not. Because of the qualitative nature of 
CSP, the assessment of CSP relies mostly on “soft” 
indicators related to management practices, rather 
than the “harder” indicators (e.g., tons of CO) emis- 
sion or of toxic releases). Common CSP measures 
include, for example, labor right protection and the 
transparency of social and environmental perfor- 
mance reporting. Several authors have described the 
challenges associated with measuring CSP (Carroll 
1999, Delmas and Doctori-Blass 2010, Graves and 
Waddock 1994, Wokutch and McKinney 1991). 

The multi-dimensionality of the CSP construct is the 
primary difficulty in measuring CSP. As Hirsch and 
Levin (1999, p. 200) note, CSP is “a broad concept or 
idea used loosely to encompass and account for a 
broad set of diverse phenomena. Rowley and Berman 
(2000) criticize studies that proxy CSP using a single- 
dimensional indicator for two main reasons: a one- 
dimensional indicator cannot represent the full 
breadth of CSP construct (i.e., the validity problem), 
and it makes comparing and unifying different stud- 
ies extremely difficult. 

Recent studies attempt to grapple with this issue by 
using simple linear aggregation of CSP data to create 
an aggregate CSP score for either a specific subset 
of CSP criteria or the entire CSP construct. In spite 
of their ease of implementation, these aggregation 
approaches have suffered from several major draw- 
backs. They often lack general applicability and are 
difficult to interpret in different contexts (Berman 
et al. 1999). We next introduce these approaches and 
describe their limitations. 


2.1. Linear Aggregation Methods 

Academic researchers have measured CSP using sur- 
vey questionnaires, content analyses of annual 
reports, expert evaluations, and regulatory compli- 
ance data (Aupperle 1991, Bowman and Haire 1975, 
Delmas and Toffel 2008, Wolfe 1991, Zahra et al. 1993). 
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Table1 Publications Counts by Journals 
Number of 
Journal titles Authors papers 
Journal of Business Ethics Albinger and Freeman (2000), Ruf et al. (2001), McGuire et al. (2003), Igalens and Gond (2005), 10 
Cho et al. (2006), Bartkus and Glassman (2008), Bouquet and Deutsch (2008), Bird et al. (2007), Chen et al. 
(2008), and Van der Laan et al. (2008) 
Business and Society Griffin and Mahon (1997), Waddock and Graves (1997a), Luce et al. (2001), Backhaus et al. (2002), 8 
Dawkins (2002), Mattingly and Berman (2006), Rehbein et al. (2004), and Shropshire and Hillman (2007) 
Academy of Management Journal Agle et al. (1999), Graves and Waddock (1994), Brown and Perry (1994), Thomas and Simerly (1995), 7 
Turban and Greening (1996), Berman et al. (1999), and Johnson and Greening (1999) 
Strategic Management Journal Waddock and Graves (1997b), Hillman and Keim (2001), and Hull and Rothenberg (2008) 3 
International Journal of Management Kennelly and Lewis (2002) and Simerly (2003) 2 
Journal of Management Ruf et al. (1998), Deckop et al. (2006), and Neubaum and Zahra (2006) 3 
Academy of Management Review Marquis et al. (2007) 1 
Administrative Science Quarterly Briscoe and Safford (2008) 1 
Journal of International Business Studies Strike et al. (2006) 1 
Journal of Management Studies Waldman et al. (2006) 1 
Review of Financial Studies Landier et al. (2009) 1 
Others Webb (2004), Kane et al. (2005), Kempf et al. (2007), Chatterji et al. (2009), and Neiling and Webb (2009) 5 
Total 43 


More recently, several for-profit organizations have 
taken up the task of measuring CSP. These include the 
SAM Group, Inc. (SAM), the Riskmetrics Group, and 
KLD. SAM, for example, gathers CSP information 
such as board structure, ability to manage risk, and 
environmental reporting system = (http://www. 
sam-group.com). The Riskmetrics Group evaluates 
corporate governance, employee and stakeholder man- 
agement, and corporate environmental performance 
(http:/ /www.riskmetrics.com). KLD ratings include 
the following CSP issues: employee relations, diversity, 
community relationships, human rights, the environ- 
ment, governance, and controversial issues (http:// 
www.kld.com) for the period 1991-2007. Up until now, 
the KLD database has been the most commonly used 
database for assessing CSP (Graves and Waddock 
1994, Turban and Greening 1996, Waddock 2003). A 
search for “Corporate Social Responsibility” and 
“KLD” in Google Scholar in May 2009 produced over 
700 hits. 


2.1.1. CSP Studies Using Linear Aggregation Ap- 
proaches. KLD has generated a flourishing literature 
on CSP in prominent academic management journals. 
In Table 1, we tally these papers by journals. The 
results show that the Journal of Business Ethics, Business 
and Society, and the Academy of Management Journal 
have published the largest number of KLD-CSP 
articles. 

The literature uses two main types of aggregation 
methodologies. The first consists of assigning equal 
weight to all categories (community relationships, 
environmental performance, human rights, etc.). For 


example, Hillman and Keim (2001) use the equal 
weights aggregation method because the literature 
“has yet to identify a ranking of importance [of 
different CSP categories] for various stakeholder 
groups and issues” (Hillman and Keim 2001, p. 131). 
By assigning equal weights, however, the researcher 
assumes all indicators are of the same or at least 
similar importance. As Bird et al. (2007) argue, this 
assumption is invalid in most cases. 

The second methodology is to gather information 
on stakeholder preferences in order to assign 
weights to specific CSP categories. Using this 
method, Ruf et al. (1998) generated weights for the 
different KLD dimensions through a survey of pref- 
erences of 101 public officers, executives of non- 
profit organizations, and managerial accountants. 
The respondents considered product/liability issues 
to have the highest weight (23%), followed by em- 
ployee relations (18%), women/minority (15%), 
environmental (14%), and community relations 
(12%). The three social dimensions considered least 
important were nuclear power (7%), military (5%), 
and South Africa (5%) (Ruf et al. 1998). Similarly, 
Waddock and Graves (1997b) developed a weighting 
scheme based on the opinion of three experts from 
the Social Issues in Management division of the 
Academy of Management who had been active in 
the social issues arena for more than 15 years. Em- 
ployee relations were found to be the most important 
category (17%), followed by product/liability issues 
(15%) and community relations (15%), and then the 
environment (14%). Other social issues considered 
include the treatment of women and minorities 
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(13.6%), nuclear power (8.9%), military contracts 
(8.6%), and South Africa (7.6%). 

An analysis of the publication counts by aggrega- 
tion types of the journals listed in Table 1 showed 
that, of the 43 publications, 26 used equal weights 
and nine used unequal weights, whereas the other 
eight studies did not use aggregation. We will argue 
below that these aggregation methods can lead to 
non-robust results. 


2.1.2. Limitations of the Aggregation Approaches. 


The first question with these aggregation approaches is 
whether these weights are justifiable. The answer from 
the literature, however, tends to be unfavorable. Research 
on stakeholder management and social participation has 
pointed out that no universally agreed-upon weights or 
prioritization of social or environmental issues can exist 
for different stakeholder groups in different situations, as 
stakeholder attributes (e.g, stakeholder composition, 
perceptions, and preferences) are dynamic and could 
change over time (Bird et al. 2007, Hillman and Keim 
2001, Mitchell et al. 1997). Even for a specific stakeholder 
group, the current weight elicitation encounters great 
difficulties when evaluating less tangible goods such as 
clean air and noise (Freeman 2003, Kuosmanen and 
Kortelainen 2007). Chatterji and Levine (2006) note that 
major social investment indexes weight the non-financial 
CSP indicators of listed companies quite differently, which 
makes comparing the CSP of firms difficult. Delquié 
(1997) further shows that biases can arise in the elicitation 
of weights. 

Rowley and Berman (2000) highlight several con- 
cerns for the simple weight aggregation approach 
the CSP-corporate financial performance literature 
uses: the aggregate score lacks a simple interpreta- 
tion; the weights are not representative of the trade- 
off between CSP indicators; and when a different 
data source is used (e.g., a new database with or 
without the addition or removal of the original CSP 
criteria), the weights and aggregate scores could lose 
their applicability and comparability. 


2.2. Strengths vs. Concerns 

As noted earlier, the CSP construct represents of both 
positive and negative firm behavior. The KLD data- 
base, which contains both “strength” and “concern” 
indicators for each CSP issue, reflects this trait. Many 
empirical researchers conduct simple aggregation of 
the indicators (e.g., strength scores minus concern 
scores) to create a CSP-item score. Yet Mattingly and 
Berman (2006) have found through a factor analysis 
that the “strength” and “concern” indicators in KLD 
data represent four distinct constructs, and thus 
they should not be combined without “carefully clar- 
ifyling] the social construct that we intend to 
measure” (Mattingly and Berman 2006, p. 41). 


Previous research has found that firms with high 
scores on their strengths also tend to have high scores 
on their concerns, as indicated by the positive corre- 
lation between the KLD strengths and concerns 
(Delmas and Doctori-Blass 2010, Mattingly and Ber- 
man 2006). Simple aggregation methods (subtraction 
of strengths from concerns), however, consider that 
firms with high scores on both strengths and concerns 
are similar to firms with low scores on both strengths 
and concerns. 

In spite of the extensive discussion of the aggrega- 
tion issues we just described, the CSP literature has 
yet to provide empirical researchers with a general 
methodology to tackle all of these criticisms. In this 
paper, we propose a weight-free evaluation approach, 
called DEA, to evaluate CSP from an efficiency per- 
spective. In the following section, we introduce the 
DEA approach. 


3. Data Envelopment Analysis 


Our methodology is based on DEA, which is a mathe- 
matical programming method for evaluating firms’ 
productive efficiency that has been used extensively 
in the operations research and management literature 
(Charnes et al. 1978, Cooper et al. 2006). In the DEA 
methodology, efficient firms are those that use min- 
imal inputs to produce maximum outputs. DEA 
evaluates a firm’s multi-factor performance by a com- 
posite efficiency index with a value between zero and 
one, with “one” representing the efficient firms. It 
does so without the need for explicit weight specifi- 
cations for inputs and outputs. These weights are 
generated automatically through an optimization pro- 
cedure, such that the evaluated firm will be assigned a 
set of “optimal weights” that maximizes the firm’s 
efficiency relative to the other firms in the sample. 
Each firm will therefore receive its most favorable 
weights, and the influence of subjective weightings 
can be eliminated. In this paper, we consider CSP 
concerns as inputs (i.e., factors to be minimized) and 
CSP strengths as outputs (i.e., factors to be maxi- 
mized). Thus the DEA score can account for the trade- 
off between positive and negative CSP indicators. 

Bendheim et al. (1998) utilize the conventional DEA 
model to identify firms’ best practices regarding their 
stakeholder relationship management. In their analy- 
sis, they select output variables as the aggregated 
scores of five CSP categories, whereas dummy vari- 
ables represent input variables (i.e., all firms have the 
same input value). Although the authors use CSP 
issues as outputs, their study does not differentiate 
between strengths and concerns. In contrast, our ap- 
proach uses CSP concerns as inputs and CSP strengths 
as outputs. 
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Figure1 Graphical Illustration of the Data Envelopment Analysis Approach 
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We use an input-oriented DEA model, where the 
objective is to minimize CSP concerns (the inputs) 
given current CSP strengths (the outputs). Figure 1 
illustrates the fundamental mechanism of the DEA 
model. In the figure, we consider one CSP concern 
and one CSP strength. First, DEA constructs the effi- 
cient frontier as a piecewise linear function that 
envelops the observed sample. Subsequently, each 
firm is benchmarked against its unique target located 
on the frontier. The DEA score represents the distance 
between the firm and the efficiency target (e.g., the 
length from O toi * divided by the length from O to îi). 
Firms on the frontier are identified as efficient and 
hence their DEA values are equal to one, whereas 
firms with a score lower than one are considered in- 
efficient (i.e., they should further reduce their concern 
levels). 

However, because we determine the efficiency fron- 
tier based on observed data, small or unrepresentative 
samples can often result in biases in the estimation of 
the efficient frontier; see Podinovski and Thanassoulis 
(2007) for further discussion and possible remedies for 
the problem. 


3.1. DEA Formulations 

Consider n firms under evaluation. In evaluating a 
firm, we consider s desirable criteria and m undesir- 
able criteria; accordingly, we denote the observed 
performance of firm j as yj to yj; and xj tO Xjm, re- 
spectively. Note that we assume these variables have a 
ratio-scale measure. In general, we would consider a 
firm superior if it has higher desirable values than 
other firms, keeping the level of undesirable indicator 
constant; or vice versa. As such, we can construct the 
performance index as 


S m . 
y= Twa / Zloto fat OD 


where the u, and v; in the formula are the weights 
attached to the rth desirable and the ith undesirable 
indicator, respectively. 

As in the classical productivity efficiency index, the 
composite CSP in this formulation is represented as a 
ratio between the aggregated good and bad. The 
weight parameters are assumed to be known and are 
supposed to reflect the relative importance among 
different criteria. A higher index score then indicates 
better CSP, and a lower score indicates worse CSP. 

As noted earlier, however, meaningful weights or 
rankings for CSP indicators are difficult to assess, es- 
pecially for different stakeholder groups. DEA can be 
helpful in addressing this problem. Instead of assign- 
ing fixed weights, DEA allows weights to be variable, 
and the following optimization problem determines 
the weights (for firm 1): 


S m 
Max Jya uir / Dopa ni 
subject to 5 uyy / Y vix; <1, for j=1,...,n, 


u,>0 for r=1,...,s; 


vj>0 for i=1,...,m. (2) 
Model (2) is commonly called the DEA multiplier 
model in the literature. The model will select weights 
that maximize the efficiency of the evaluated firm. The 
first set of constraints standardizes the evaluation re- 
sults such that the efficiency scores of all firms should 
not exceed one. The second set of constraints guaran- 
tees the weights are non-negative. We then solve the 
problem for each evaluated firm by replacing the 
parameter values in the objective function. 

For computational convenience, we can reformulate 
the problem as an equivalent linear programming 
problem by using the Charnes—Cooper transformation 
for fractional linear problems; namely, we replace the 
objective function with 


Max UrYir, (3) 


s 
r=1 


and add a linearizing constraint 


m 
5 vX = 1. (4) 
i=1 


The objective value (3) can be interpreted as the dis- 
tance between the focal firm’s CSP and the best CSP 
performer in the sample. In this case, we can define CSP 
efficiency as the extent to which a firm can reduce its 
current concerns, given its strengths; for example, a score 
of 0.9 means the firm can decrease its overall CSP con- 
cerns by 10% relative to the best practice (ie., CSP- 
efficient firms). Without the exogenous influence of 
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weights, DEA scores can capture the true underlying 
difference in CSP (Charnes et al. 1978, Cooper et al. 2006). 

The dual linear programming problem of the mul- 
tiplier model (2) is generally referred to as the 
envelopment model: 


Min 0; 


n 

subject to So Aix < Qix, for i1=1,...,m, 
j=l 

(5) 


Ayjr > Yir, for r=1 s 


1 


pees 


n 
? 


J 
Ap >0, for PSA, 2s, 

The envelopment model (5) minimizes the contrac- 
tion ratio 0; such that the evaluated firm can become 
CSP efficient after contraction (i.e., finding its bench- 
mark on the frontier). This interpretation corresponds 
to the graphical illustration in Figure 1. 

Traditional DEA models, however, assume the in- 
put and output variables are in the ratio or interval 
scale. They are therefore not appropriate for the KLD 
data, which consists of data that only have meaning in 
the ordinal scale. Specifically, we know a firm that 
scores three points in the environmental strength cat- 
egory is not “three times better” than a firm that 
scores one point in the same category—these scores 
can only be appropriately compared in the ordinal 
scale. To maintain research validity, scientific analysis 
should therefore be carried out in concordance with 
the measurement scale of observations. Cook and Zhu 
(2006) developed an extension of the DEA model for 
ordinal input and output variables, that is, variables 
measured in Likert scale. We therefore adopt the DEA 
model, Cook and Zhu (2006) developed for ordinal 
data. The Cook and Zhu model is developed based on 
the extension of the classical DEA model, but it in- 
volves a higher level of mathematical detail; see Zhu 
(2003) and Cook and Zhu (2006). 


3.2. Identifying Benchmark Targets Using DEA 

Scores from the DEA model can be interpreted as the 
reduction ratio of concern levels necessary for the firm 
to become CSP efficient, because our DEA model is 
input oriented. We therefore seek to reduce concerns, 
given the firm’s current strengths. We provide an ex- 
ample in Table 2. The KLD scores are presented in the 
first column. The benchmark KLD scores are pre- 
sented in the second column, and we obtain them by 
multiplying the KLD score by the DEA score (here 
0.9475). One problem is that the benchmark efficiency 
score can be fractional. For example, in Table 2, the 
target of the community concern is 3.8. One way to 
interpret the score in terms of changes for CSP vari- 
ables is to round the fractional number down to its 
nearest integer. Column 3 in Table 2 presents the 


Table2 Example Kinder, Lydenberg, and Domini, Inc. (KLD) Strength, 
Concern Rankings, and the DEA Benchmark 


Efficiency score = 0.9475 


Input-oriented benchmark 


KLD target (KLD score x Rounded 
CSP issues score efficiency score) target 
Concern 
Community 4 3.8 3 
Diversity 2 1.9 1 
Employee relations 3 2.8 2 
Environment 6 5.7 5 
Human rights 2 1.9 1 
Product and services 3 2.8 2 
Strength 
Community 2 2 2 
Diversity 4 4 4 
Employee relations 3 3 3 
Environment 2 2 2 
Human rights 1 1 1 
Product and services 1 1 1 


CSP, corporate social performance. 


rounded benchmarks for all concerns. If we wanted to 
find out how much a firm needs to increase its 
strengths to reach efficiency, we would need to com- 
pute an output-oriented DEA model. 

Several studies have used DEA or its variants to 
assess the environmental performance of firms (see 
Zhou et al. 2009 for a survey). What is similar in 
evaluating environmental performance and CSP is 
that we need to consider both desirable and undesir- 
able performance, for example, electricity generated 
and the greenhouse gas emissions from a utility gen- 
eration plant. Compared with CSP, the environmental 
data that these studies use are more directly observ- 
able, such as energy consumption, pollutants emitted, 
and some financial or economic measures. Our paper 
is thus distinct from previous DEA studies of envi- 
ronmental issues in that we account for the qualitative 
and ordinal nature of CSP data, namely, that CSP 
variables are related to management practices rather 
than actual performance outputs and are measured on 
an ordinal scale. However, we should note that the 
KLD database also considers some quantitative data 
in its qualitative assessment of firm performance (e.g., 
information from the toxic release inventory by the 
Environmental Production Agency). 


4. Empirical Analysis 

In this section, we present the KLD data and the ag- 
gregation methodologies used in the literature. We 
then compare the composite CSP scores we obtained 
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from the DEA model with those from the weight ag- 
gregation methods the literature reports. 


4.1. CSP and KLD 
Several studies in the literature have criticized the in- 
dicators used in empirical studies (including the KLD 
database) as not fully grounded in the theoretical 
development of CSP, and using a fixed set of measures 
presupposes a “one-size-fits-all” property of CSP for 
different industries (see, e.g., Gond and Crane 2010, 
Mattingly and Berman 2006, Rowley and Berman 
2000). Yet in view of the vague boundary and com- 
plexity of CSP, the KLD database has been deemed 
“the de facto research standard at this moment” and 
“the best currently available to scholars” (Waddock 
2003, pp. 369 and 371). Its 2007 version includes the 
CSP assessment of the 3000 largest US publicly traded 
companies over 21 CSP issues, which can be classified 
into four major CSP dimensions: environmental, 
social, governance, and controversial business in- 
volvement ratings. 

KLD evaluates each of the CSP issues by a number 
of concern and strength variables, and these variables are 
coded as binary variables. For example, in the climate 


Figure 2 


1-1. 
Level 1: Categories Environmental 


performance 


Level 2: Issues. 2-1 
e Climate change 
e Product and services 
e Operations & management 
e Other factors 


Level 3: Items Concerns 


Climate change 

Ozone depleting chemicals 
Agricultural chemicals 
Hazardous waste 
Regulatory problems 
Substantial emissions 


Strengths 
e Clean energy 


Pollution prevention 
Recycling 
Management systems 


Beneficial products & services 


change issue, “taking significant measure to reduce 
emissions by using clean energy” is considered a 
strength, whereas “deriving significant profits from 
the sale of fossil fuels and their derivatives” is regarded 
as a concern. A team of experts from KLD investigates, 
using a variety of data sources, how a firm can score on 
individual KLD strength and concern variables. Their 
sources include direct communication with the com- 
pany managers, public documents, government and 
NGO information, and media reports (see http://www. 
kld.com for a complete description). 

As do most studies in the literature, we exclude in 
our analysis the nine CSP issues under the “controver- 
sial business involvement” umbrella, as no theory or 
evidence yet supports their roles in the CSP research 
(Berman et al. 1999, Turban and Greening 1996). The 
nine CSP issues include abortion, adult entertainment, 
alcohol, contraceptives, firearms, gambling, military, 
nuclear power, and tobacco. In what follows, we will 
describe the main variables included in the database 
and how the literature has aggregated the KLD data. 
Figure 2 illustrates the structure of the KLD database, 
which includes three main categories: environmental 
performance, social ratings, and governance ratings. 


Illustration of the Kinder, Lydenberg, and Domini, Inc. Structure Ver. 2007 


CSP 
1-2. Social re 
í Governance 
ratings ; 
ratings 
2-2 
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e Human rights e Other 
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Strengths e Political 


e Charitable giving accountability 
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e Support for e Transparency 

education e Compensation 
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Table3 Kinder, Lydenberg, and Domini, Inc. (KLD) Statistics 


Number of variables Concerns Strength 
Category Issues Concern Strength Mean SD Mean SD 
Environmental performance Climate change 1 1 0.048 0.213 0.034 0.182 
Product and services 2 1 0.007 0.09 0.019 0.136 
Operations and management 3 3 0.138 0.479 0.07 0.313 
Others 1 1 0.007 0.09 0.019 0.136 
Social ratings Community 4 7 0.111 0.335 0.117 0.447 
Diversity 3 8 0.431 0.513 0.606 1.046 
Employee relations 5 6 0.524 0.71 0.275 0.6 
Human rights 4 3 0.046 0.232 0.005 0.069 
Product 4 4 0.232 0.575 0.044 0.216 
Governance ratings Reporting 2 2 0.001 0.026 0.035 0.199 
Structure 3 2 0.356 0.483 0.155 0.362 
Other 1 1 0.036 0.186 0.002 0.045 


Within each of these main categories, several issues are 
considered, such as climate change and operations and 
management within the environmental performance 
category. A number of concern and strength binary 
variables (i.e., the variable is equal to one if the firm 
meets the criteria of the concern or strength variable 
and equal to zero otherwise) subsequently represent 
each issue. For example, in determining the climate 
change concern item for the environmental performance 
category, the team of KLD experts will use various data 
sources to assess whether “The company derives substan- 
tial revenues, directly or indirectly, from the sale of coal or oil 
and its derivative fuel products.” If the KLD team con- 
cludes the evaluated firm satisfies the above 
description, this firm’s climate concern score is one; 
otherwise, the score will be zero. We provide a partial 
list of concern and strength items in Figure 2; see 
http://www.kld.com for the full list of these items and 
their definitions. 


4.2. Data and Methods 

We utilize the 2007 KLD data, which contain the CSP 
ratings of around 3000 of the largest publicly traded 
firms in the United States. Table 3 gives the descrip- 
tive statistics at the issue level (e.g., climate change, 
diversity, human rights, and so on), which we use as 
the basis for subsequent calculation. 

The empirical and conceptual CSP literatures have 
both reiterated the substantial influence of industrial 
effects on the analysis of CSP (Griffin 2000, McWil- 
liams and Siegel 2000, 2001, Waddock and Graves 
1997b). To take into account the industry effect, we 
classified the 2007 sample according to the first two 
digits of the SIC code (see Table 4). Our analysis fo- 
cuses on the three largest industries in the 2007 
sample: manufacturing, finance, and service industries. 


In the subsequent analysis, we use the three most 
widely used CSP weighting schemes. These are shown 
in Table 5 and include (i) equal weights, (ii) weights 
derived from expert opinions (Waddock and Graves 
1997b), and (iii) weights derived from survey of public 
affairs officers, executives of non-profit organizations, 
and managerial accountants (Ruf et al. 1998). 

However, as the KLD database has updated the 
evaluated CSP items and the number of strength and 
concern variables over the years, the weights Ruf et al. 
(1998) and Waddock and Graves (1997b) developed 
no longer match the 2007 version of the KLD da- 
tabase.' In order to compare our results from DEA 
with those of these previous methods, we choose to 
remove the corporate ratings category from the sample 
because neither Ruf et al. nor Waddock and Graves 
provided a weight for this CSP category. For the same 
reason, we also combine the four issues of the cate- 
gory environmental performance (see Figure 2). 


Table 4 SIC Industry Classification of the Kinder, Lydenberg, and Domini, 
Inc. (KLD) 2007 Sample 


Industry Number of firms 
Manufacturing 1072 
Finance, insurance, and real estate 661 
Services 457 
Retail trade 192 
Mining 132 
Transportation, communications, electric, gas, and 288 
sanitary services 

Wholesale trade 79 
Construction 37 
Public administration 12 
Agriculture, forestry, and fishing 6 
Total 2936 
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Table5 The Aggregation Weights 


Equal Ruf et al. Waddock and 
Category weights (1998) Graves (1997b) 
W1-community 0.111 0.125 0.148 
W2-diversity 0.111 0.152 0.136 
W3-employee relations 0.111 0.183 0.168 
W4-environment 0.111 0.141 0.142 
W5-human rights 0.111 0.152 0.136 
W6-product and services 0.111 0.228 0.154 
W7-nuclear power 0.111 0.089 0.074 
W8-military contract 0.111 0.086 0.050 
W9-South Africa 0.111 0.076 0.046 


With these weights, we calculate the KLD-CSP 
score according to the formula 


KLD—CSP score j = X. p;(yji — xji). (6) 
i=1 


Yji and xj; denote firm j’s number of strengths and 
concerns in CSP category i, respectively; p; is the 
weight for category i. In the formula, we first calculate 
the category score by subtracting concerns from 
strengths. Then we can obtain the KLD-CSP score 
simply as the weighted sum of the category scores. 
Note that the fixed-weight approach only applies 
weighting to CSP categories; the concern and strength 
variables are merged by subtraction. This research 
design implies that any concern matters as much as 
any strength in a category. 

By contrast, we do not need to impose such an 
assumption in the DEA model. In the empirical appli- 
cation, we consider separately the strength and concern 
levels of the six CSP categories that correspond to the 
W1-W6 categories in Table 5 (so in total the DEA model 
uses 12 variables). We follow the ordinal measurement 
scale of KLD data, and the DEA program determines the 
relative weights for different categories as described in 
section 3. We conduct the DEA analysis independently 
for the three industries considered. 


4.3. Results 

We apply both the DEA method and the three weight- 
ing schemes (i.e., equal weights and those developed 
by Ruf et al. 1998 and by Waddock and Graves 1997b) 
to the KLD 2007 dataset. Table 6 shows the descriptive 
statistics of the scores from the different approaches. 
We obtained the scores for the three weighting 
schemes by applying the weights from Table 5 to 
Equation (6). We obtained the DEA scores from the 
DEA model for ordinal data (Cook and Zhu 2006), 
which is an extension of the general DEA model. We 
refer the readers to Zhu (2003) and Cook and Zhu 
(2006) for more details about the model. 


Table6 Descriptive Statistics of Corporate Social Performance (CSP) 


Scores 
Equal Ruf et al. Waddock and Rank 

Industry weights (1998) Graves (1997b) DEA 
Manufacturing (n = 1072) 

Mean — 0.0661 — 0.6181 — 0.0454 0.9695 

SD 0.2780 0.3810 0.3499 0.0076 

Minimum — 0.9999 — 1.2226 — 1.0720 0.9475 

Maximum 1.6667 2.2069 2.1000 1.0000 
Finance (n = 661) 

Mean — 0.0108 — 0.0241 — 0.0052 0.9857 

SD 0.1944 0.2609 0.2429 0.0059 

Minimum — 0.6666 — 0.9000 — 0.6080 0.9700 

Maximum 0.7777 1.1638 1.1440 1.0000 
Services (n = 457) 

Mean — 0.0557 — 0.0480 — 0.0335 0.9797 

SD 0.2219 0.3237 0.2835 0.0053 

Minimum — 0.6667 — 0.9757 — 0.7440 0.9675 

Maximum 1.3333 2.2686 1.9980 1.0000 


DEA, data envelopment analysis. 


Figure 3 provides the distribution plots of KLD 
scores. These figures in general appear to be bell- 
shaped, although none of them can pass the Shapiro- 
Wilk test for normality at the 1% significance level. 

From the DEA scores, we can obtain additional 
insights. For all three industries, only a small propor- 
tion of the firms are CSP-efficient with a score of 
one (manufacturing: 3.17%; finance: 4.08%; service: 
2.63%). However, the average DEA efficiency score 
from our sample is 0.976; this indicates that on aver- 
age firms can reduce their CSP concern levels by 2.4%, 
given their current level of CSP strengths. Obtaining 
high average efficiency scores in empirical DEA ap- 
plications is not uncommon. For instance, Majumdar 
and Marcus (2001) report an average efficiency score 
of 0.78 with a standard deviation of 0.24. Similarly, 
Goto and Tsutsui (1998) report an average efficiency 
score of 0.90 for US utilities for 1984-1993. The high 
average efficiency score in our current study is also 
due to the low level of variations in the KLD data. 
Table 7 contains the percentages of firms with the 
highest strength scores (e.g., five out of five possible 
points) and the lowest concern scores (e.g., zero out of 
seven possible points) in the six KLD concern and 
strength categories. For most CSP issues (e.g., com- 
munity, environment, and human rights), we observe 
a high proportion of firms that attained the highest 
strength and the lowest concerns scores. This finding 
suggests a majority of firms in the KLD sample are 
located in the vicinity of the efficient frontier and ex- 
plains why we have a high average efficiency score. 

From the statistics Table 6 reports, we can see that 
firms from the financial sector are in general more 
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Figure3 2007 Kinder, Lydenberg, and Domini, Inc. Scores Using Weighted Aggregation 
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DEA score: manufacturing industry 


CSP efficient than firms from the manufacturing and 
service industries. The mean of the efficiency score 
of the financial sector is higher than the two other 
sectors. Figure 3 gives a good indication that the CSP 
efficiency frontiers of the manufacturing and service 
industries are defined by relatively few leading firms, 
whereas the majority of other firms in the sample are 
lagging behind. 

Our results also illustrate that previous aggregation 
approaches cannot always sort out CSP-efficient 
firms. In Table 8, for example, we list the 34 CSP- 
efficient firms (i.e., DEA score equal to one) and their 


DEA score: Finance industry 


DEA score: Service industry 


ranks with the weighted aggregation scores. In Table 
9, we include the list of the weighted aggregation 
scores of the 34 firms from the bottom of DEA rank- 
ings. In Table 8, although firms with the highest 10 
aggregation scores are also CSP efficient, many CSP- 
efficient firms are ranked below 20%. 

The primary reason for the difference in the rank- 
ings across methodologies is that, in the aggregation 
approaches, firms’ CSP scores depend on the firm 
performance within specific CSP categories, whereas 
DEA considers individual CSP concerns and strengths 
and allows for compensation across concerns or 
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Table7 Percentages of Firms in Our Sample With the Highest Strength 
Scores and the Lowest Concern Scores by Kinder, Lydenberg, and 
Domini, Inc. (KLD) Category* 


Table8 Corporate Social Performance (CSP) Efficient Firms in the Man- 
ufacturing Industry and their Ranks (n = 1072) 


Ranking 
% of firms in the % of firms in % of firms in Ranking according Ranking Ranking 
KLD concern manufacturing the finance the service according to Waddock according according 
and strength industry with the industry with industry with to Ruf etal. and Graves to equal to DEA 
variables best scores the best scores the best scores Firm names (1998) (1997b) weights scores 
KLD concern 3M Company 26 35 26 1 
Community 92.07 83.18 95.84 Advanced Micro Devices, 11 11 9 1 
Diversity 56.44 62.33 57.55 Inc. 
Employee relations 52.71 77.76 60.39 Agilent Technologies, Inc. 2 3 2 1 
Environment 81.34 99.39 99.56 Alcoa, Inc. 524 718 254 1 
Human rights 95.24 98.94 96.28 Applied Materials, Inc. 10 9 9 1 
Product and services 82.65 83.36 85.12 Avon Products, Inc. 12 12 15 1 
KLD strength Bristol-Myers Squibb 62 25 55 1 
: Company 
Community 92.16 84.72 96.72 Guat a a 707 : 
Diversity 66.04 66.41 60.61 aie ao ets 
Dell, Inc. 30 18 55 1 
Employee 72.67 81.85 86.21 Ka ERT Ji oi i ; 
Environment 83.58 99.39 97.81 eta apay 
, Ford Motor Company 55 31 32 1 
Human rights 99.44 99.70 99.78 
: General Mills Incorporated 3 4 6 1 
Product and services 92.91 97.88 97.16 
General Motors Corpora- 162 63 59 1 
Average 80.60 86.24 86.09 tion 
*This table displays the percentage of firms with the highest strength scores Graco, Inc. 52 83 59 1 
(e.g., five out of five possible points) and the lowest concern scores (e.g., zero out Green Mountain Coffee 8 8 4 1 
of seven possible points). Roasters, Inc. 
Harley-Davidson, Inc. 86 93 130 1 
Herman Miller, Inc. 9 10 9 1 
strengths in different categories. The DEA model will Hewlett-Packard 4 2 5 1 
seek the optimal trade-offs between different concerns Company i 
and strengths for the evaluated firm (i.e, weights Intel Corporation 1 1 1 1 
attached to concerns and strengths). With the Johnson & Johnson 7 16 9 1 
aggregation methods, firms will tend to receive low Kraft Foods, Inc. 21 27 17 1 
aggregation scores if they underperform in specific Lilly (Eli) and Company 65 34 22 1 
categories (i.e., high concern and low strength in the Mattel, Inc. 87 58 33 1 
same category). For example, if a firm has a high Molex Incorporated 80 142 141 1 
strength and a high concern within a specific category, Motorola, Inc. 5 5 8 1 
the final score will still be average because strengths Ke, inc. 35 19 9 1 
and oe eee soe eee out. n PepsiCo, Inc. 108 81 26 1 
In or er to un erstan 1 erences 1n ran ing, we Procter & Gamble 18 24 17 1 
describe in detail the ranking of three firms: Intel, Company 
Alcoa, and Coca-Cola, presented in Table 8. These Steelcase, Inc. 14 17 15 1 
three firms are efficient with the DEA approach. In the 
x y Texas Instruments Incor- 7 7 7 1 
aggregation method, however, only Intel obtains a porated 
high ranking (1). Alcoa and Coca-Cola obtain lower Timberland Company 15 15 14 1 
rankings (from 200 to 700 for both). Looking at the (The) 
individual scores for each of these firms can help us yalero Energy Corporation 258 280 707 1 
ee this seus (see ee ae i Waters Corporation 85 137 141 1 
ntel receives low concerns an igh strengths Xerox Corporation 6 6 3 1 


across different KLD categories in Table 10. There- 
fore, Intel is efficient in the DEA model. The fifth 
column in the table shows the maximum score in the 
sample for individual variables. Alcoa and Coca-Cola 
obtain lower scores in the aggregation methodology. 
Alcoa has a comparatively low score in the diversity 


DEA, data envelopment analysis. 


category, and Coca-Cola has a low score in the prod- 
uct category (see Table 10). We can trace their low 
performance in these categories back to their original 
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Table9 Some Corporate Social Performance (CSP) Inefficient Firms in the 
Manufacturing Industry and their Ranks (n = 1072) 


Table 10 Original Kinder, Lydenberg, and Domini, Inc. (KLD) Concern and 
Strength Scores 


Ranking Ranking 
according according Ranking Ranking 

to Ruf to Waddock according according 

et al. and Graves to equal to DEA 
Firm names (1998) (1997b) weights scores 
Exxon Mobil Corp. 1069 1068 1071 1072 
Tyson Foods, Inc. 1071 1072 1070 1072 
Cintas Corporation 1072 1070 1049 1070 
Covidien Ltd. 1062 1061 1071 1070 
Exide Technologies 1060 1066 1066 1070 
Koppers Holdings, Inc. 1065 1060 1049 1070 
Smithfield Foods, Inc. 1064 1067 1049 1070 
Brunswick Corporation 1066 1059 1066 1065 
Bunge Limited 1067 1069 1049 1065 
Celanese Corporation 1070 1071 1069 1065 
FMC Corporation 1053 1046 1030 1065 
Goodyear Tire & Rubber 1056 1052 1049 1065 
Grace (W.R.) & Co. 1048 1047 1049 1065 
McDermott Intl, Inc. 1063 1065 1049 1065 
Pilgrim’s Pride Corp. 1068 1063 1049 1065 
Archer-Daniels-Midland 1054 1054 1066 1057 
Chemtura Corporation 1052 1054 1049 1057 
Crown Holdings, Inc. 1061 1051 1049 1057 
Hercules Incorporated 1021 1021 1030 1057 
Ingersoll-Rand Com- 1025 1033 990 1057 
pany 
NL Industries, Inc. 1024 1020 990 1057 
Seaboard Corporation 1058 1062 1049 1057 
ConocoPhillips 1057 1064 1049 1057 
Abitibi Bowater, Inc. 1049 1053 1030 1048 
AK Steel Holding Corp. 1050 1058 1049 1048 
Carolina Group 1059 1034 1030 1048 
Caterpillar, Inc. 1038 1027 1030 1048 
Cytec Industries, Inc. 1046 1024 1030 1048 
Honeywell Intl, Inc. 954 968 990 1048 
Huntsman Corporation 1026 1019 1030 1048 
L-3 Com., Inc. 1039 1024 990 1048 
Masco Corporation 1040 1034 1030 1048 
Mueller Water Products 1040 1034 990 1048 
Murphy Oil Corporation 1045 1056 1049 1048 


DEA, data envelopment analysis. 


KLD scores in these two categories in Table 10. Alcoa 
and Coca-Cola, however, both attain the efficiency 
status in the DEA model. As noted, DEA allows for 
making trade-offs between concerns or strengths. For 
example, Intel is low in its diversity strength. Then 
the DEA model will tend to reduce the weight for the 
diversity strength and assign a higher weight to the 
environmental strength, in which Alcoa excels. Alcoa 
and Coca-Cola are both considered efficient because 


Highest score 


in sample 
Coca- (lowest in the Exxon 
CSP items Intel Alcoa Cola parentheses) Mobil 
COM-con-# 1 2 1 3 3 
COM-str-# 3 2 2 4 1 
Total strengths minus 2 0 1 4(-2) —2 
total concerns 
DIV-con-# 0 0 0 2 1 
DIV-str-# 5 1 
Total strengths minus 5 1 6 (— 2) 
total concerns 
EMP-con-# 0 2 1 
EMP-str-# 5 2 0 
Total strengths minus = 5 0 —1 5 (-— 3) 0 
total concerns 
ENV-con-# 1 4 1 5 5 
ENV-str-# 3 3 2 4 1 
Total strengths minus 2 -1 1 4(-4) —4 
total concerns 
HUM-con-# 0 1 2 3 1 
HUM-str-# 0 0 1 1 0 
Total strengths minus 0 -1 -1 1(-3) — 
total concerns 
PRO-con-# 1 0 3 4 2 
PRO-str-# 1 0 0 
Total strengths minus 0 0 —3 2 (-4) — 


total concerns 


CSP, corporate social performance. 


their relative leading status in certain concern or 
strength items is enough to cover those items in which 
they lag behind. 

In contrast to Table 8, Table 9 lists the weighted 
aggregation scores of the firms in the bottom 34 firms 
of the DEA rankings. The results in Table 9 show that 
the DEA rankings are more consistent with those of 
the previous approaches than they are for efficient 
firms. This observation is not surprising because most 
of these companies score high on the concerns and 
low on their strengths. 

We take the example of Exxon Mobil, which is 
ranked below 1000 in the aggregation and DEA ap- 
proaches. Table 10 shows Exxon Mobil’s KLD scores. 
Exxon Mobil has the lowest scores for the community 
and environment categories in the entire sample, and 
its scores for other categories are also relatively low. 
Hence Exxon Mobil has low rankings with the aggre- 
gation scores. Regarding the KLD disaggregated 
scores at the “item” level, Exxon Mobil scores low 
on 10 out of 12 items, which is why Exxon Mobil also 
receives a low DEA ranking. 
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4.4. Statistical Comparisons of CSP-KLD Scores 

In this section, we run three different tests to compare 
distribution of the CSP scores the DEA and weighted 
aggregation approaches generate. We first use the Wil- 
coxon test for medians, then the Levene test for 
equality of variance, and finally the Kendall’s tau rank 
correlation, which measures the degree of correspon- 
dence between the two rankings. We find that DEA 
differs from the aggregation approaches in terms of its 
median and variance at the 1% significance level. 
Likewise, the aggregation methods also differ along 
these two parameters. We find a positive and signifi- 
cant correlation between DEA and the aggregation 
methods. However, the correlation is stronger among 
the aggregation methods. 


4.4.1. Test for Median and Variance. We use the 
non-parametric Wilcoxon matched-pairs signed-ranks 
test to examine the difference in the medians of the 
scores using fixed-weight specifications and DEA. In 
total, we perform the test for six different pairs of 
scores. Results of the pairwise comparisons indicate 
all KLD-CSP distributions are significantly different 
at the 1% significance level. We found the DEA scores 
have statistically dissimilar distributions than the 
three types of aggregation scores (i.e., all p-values 
are close to 0). The finding implies the KLD-CSP 
distributions are sensitive to different weight con- 
figurations. Future studies should pay attention to this 
sensitivity. 

Next we used the Levene test to assess the equality 
of variances. The Levene test is robust under non- 
normality. For all pairs of score distributions, the test 
rejects that any pair of score distribution has 
equal variance at the 1% significance level. This re- 
sult indicates that the distributions of weighted 
aggregations scores tend to have different variances. 
We find that, when combined with the Wilcoxon test 
results, the weighted aggregation approaches could 
produce sensitive distributions even with small 
changes in weights. 


4.4.2. Correlation Test. The Kendall’s tau coeffi- 
cient is a non-parametric statistic commonly used to 
measure the degree of correspondence between two 
rankings. Table 11 shows the Kendal’s tau correlation 
coefficients. 

All Kendall’s tau coefficients in the table are sig- 
nificant at the 1% significant level. The coefficients 
reveal that the ranking of the three aggregation 
scores is more highly correlated with each other 
(from 0.77 to 0.91) than with the ranking of DEA 
scores (around 0.49). Among the three aggregation 
approaches, rankings based on Ruf et al. (1998) and 
Waddock and Graves (1997b) are more strongly cor- 
related because the aggregation weights they used 


Table11 Kendall’s Tau Rank Correlations for the Pooled Sample (n = 2190) 
Ruf et al. | Waddock and Equal DEA 
(1998) Graves (1997b) weights score 
Ruf et al. (1998) 1 
Waddock and Graves (1997b) 0.9097* 1 
Equal weights 0.7817* 0.7744* 1 
DEA score 0.4977* 0.4925* 0.4940* 1 


DEA, data envelopment analysis. 
“significant at the 1% significance level. 


are similar (see Table 5). The correlation between the 
DEA ranking and other rankings is not as high 
mainly because of the methodological difference ex- 
plained in the previous section. 


5. Discussion and Conclusion 


The aggregation methodology of multiple CSP metrics 
that most of the literature has adopted poses major 
methodological challenges to researchers: the aggre- 
gation score lacks comparability and interpretability, 
and it ignores the ordinal nature of CSP data. We 
tested the validity of the traditional CSP aggregation 
approach using the 2007 KLD data and found this 
approach exhibits statistically different distributions 
when different weights are applied. Specifically, 
we analyzed the mean and variance of the weighted 
aggregation scores and found that minor changes in 
the aggregation weights could lead to significant 
changes in these two important parameters of score 
distributions. 

In this paper, we provided an effective methodol- 
ogy to circumvent these issues. We took an efficiency 
perspective and utilized the DEA model for ordinal 
data to create a single CSP efficiency index from the 
KLD data. In our model, the strengths and concerns of 
CSP are separate components of a firm’s composite 
CSP index. Our application is distinct from the eco- 
efficiency studies that deal with environmental im- 
pacts rather than CSP data. Our DEA approach allows 
us to incorporate “soft” data represented on an ordi- 
nal scale. The DEA approach has several advantages 
over the linear aggregation methods: it does not re- 
quire a priori weight rankings or specifications for 
CSP criteria; the DEA score has a direct interpretation 
(what is your CSP compared with the best in class?); 
and it can be applied to a diverse set of measures of 
CSP. For example, the DEA method can include soft 
and hard measures of performance. Researchers can 
also expand it to compare eco-efficiency to productive 
efficiency (Chen et al. 2010). 

For the empirical CSP literature, the DEA efficiency 
score also provides an ideal proxy for CSP in eco- 
nometric models as a dependent or independent 
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variable (e.g., Delmas and Tokat 2005, Delmas et al. 
2007, Simar and Wilson 2007). In many CSP studies, 
researchers are interested in understanding the rela- 
tionship between CSP and some exogenous variables, 
or the relationship between the CSP components. So 
we should remember that the KLD data, and hence 
the resultant DEA scores, are meaningful in the ordi- 
nal sense only, and this measurement scale should be 
carried over in the subsequent analysis. Thus, for ex- 
ample, ordinal regression techniques may be more 
appropriate when the KLD scores are regarded as the 
dependent variable. 

The output from our DEA model, the efficiency 
scores, can also help devise strategies at the opera- 
tional level to improve the firm’s CSP. For example, a 
CEO could refer to the efficient benchmark and make 
resource and operation planning decisions to improve 
the firm’s CSP. As many companies are paying greater 
attention to the CSP of their supply chains, such com- 
panies can use our model to benchmark their supply 
chain partners’ CSP. Information such as efficiency 
scores and benchmark targets can prove useful in 
supplier base management and in monitoring the CSP 
of the company’s supply chain. 

Finally, we provide some limitations of the current 
study and suggestions for extension. The standard 
DEA model requires the evaluated firm to minimize 
inputs or maximize outputs proportionally to reach 
the efficient frontier. Recently, researchers have devel- 
oped DEA models that allow for non-proportional 
changes in inputs or outputs (e.g., Cooper et al. 2006, 
Chapter 4). Further research could develop similar 
models amenable to ordinal data. 

In this paper, we did not impose weight restriction 
constraints on the u, and v; in the DEA model (2). 
However, the evaluator may have personal prefer- 
ences regarding CSP issues, or face an exogenously 
given rule on how to weight different CSP issues. 
When priorities among CSP issues are precisely artic- 
ulated, DEA is less instrumental for CSP evaluation 
(e.g., environmental strength must be of equal im- 
portance to social strength). However, when the pri- 
oritizing relationship is fuzzy or more flexible, we 
may apply weight restrictions to DEA to reflect the 
evaluator’s specific preferences for different CSP cri- 
teria. For example, the evaluator can determine that 
the weight for the environmental strength in the DEA 
model must be no less than the weight for the social 
strength in the DEA model. When the data lack vari- 
ability or the sample is not sufficiently large, imposing 
weight restrictions can in general increase the dis- 
crimination of DEA results (ie., a wider range of 
efficiency scores). See chapter 6 of Cooper et al. (2006) 
for a general discussion on weight restrictions in DEA, 
and Cook and Zhu (2006) for the exact implementa- 
tion formulation. 


Because of our methodological focus, we only use the 
cross-sectional 2007 KLD data in this paper. However, 
we can expect a firm’s current CSP score to influence its 
future CSP scores. Moreover, the intensity and property 
of this dynamic effect can differ for strengths and con- 
cerns. Thus future research can conduct longitudinal 
analysis and investigate the dynamic interrelationships 
between concerns, strengths, and CSP over time, which 
can reveal further insights into the evolution of CSP 
(Chen and van Dalen 2010). Finally, another interesting 
direction is to combine CSP information with financial 
performance measures to form a more comprehensive 
corporate performance evaluation using DEA. 


Note 


1In 1996, KLD removed the property, plant, equipment item 
from the environmental performance category, and, in 1999, 
it added a climate change item to the environmental per- 
formance category. In 2005, KLD added the governance 
rating category. In 2006, it added a management systems 
strength item to the environmental performance category. 


References 


Agle, B., R. Mitchell, J. Sonnenfeld. 1999. Who matters to CEOs? An 
investigation of stakeholder attributes and salience, corporate 
performance, and CEO values. Acad. Manage. J. 42(5): 507-525. 


Albinger, H., S. Freeman. 2000. Corporate social performance and 
attractiveness as an employer to different job seeking popula- 
tions. J. Bus. Ethics 28(3): 243-253. 

Atasu, A., V. D. R. Guide Jr, L. N. Wassenhove. 2008. Product reuse 
economics in closed-loop supply chain research. Prod. Oper. 
Manag. 17(5): 483-496. 

Aupperle, K. E. 1991. The use of forced choice survey procedures in 
assessing corporate social orientation. Res. Corp. Soc. Perform. 
Policy 12: 269-280. 

Backhaus, K. B., B. A. Stone, K. Heiner. 2002. Exploring the rela- 
tionship between corporate social performance and employer 
attractiveness. Bus. Soc. 41(3): 292-318. 

Bartkus, B., M. Glassman. 2008. Do firms practice what they preach? 
The relationship between mission statements and stakeholder 
management. J. Bus. Ethics 83(2): 207-216. 

Bendheim, C. L., S. Waddock, S. Graves. 1998. Determining best 
practice in corporate-stakeholder relations using data envelop- 
ment analysis. Bus. Soc. 37(3): 305-338. 

Berman, S., A. Wicks, S. Kotha, T. Jones. 1999. Does stakeholder 
orientation matter? The relationship between stakeholder man- 
agement models and firm financial performance. Acad. Manage. 
J. 42(5): 488-506. 

Bird, R., A. D. Hall, F Momente, F. Reggiani. 2007. What corporate 
social responsibility activities are valued by the market? J. Bus. 
Ethics 76(2): 189-206. 

Bouquet, C., Y. Deutsch. 2008. The impact of corporate social perfor- 
mance on a firm’s multi-nationality. J. Bus. Ethics 80(4): 755-769. 

Bowen, F. E., P. D. Cousins, R. C. Lamming, A. C. Faruk. 2001. The 
role of supply management capabilities in green supply. Prod. 
Oper. Manag. 10(2): 174-189. 

Bowman, E. H., M. Haire. 1975. A strategic posture toward corpo- 
rate social responsibility. Calif. Manage. Rev. 18(2): 49-58. 

Briscoe, F., S. Safford. 2008. The Nixon-in-China effect: Activism, 
imitation, and the institutionalization of contentious practices. 
Adm. Sci. Q. 53(3): 460—491. 


Chen and Delmas: Measuring Corporate Social Performance with DEA 


Production and Operations Management 20(6), pp. 789-804, © 2010 Production and Operations Management Society 803 


Brown, B., S. Perry. 1994. Removing the financial performance halo 
from fortune’s “most admired” companies. Acad. Manage. J. 
37(5): 1347-1359. 

Carroll, A. B. 1999. Corporate social responsibility: Evolution of a 
definitional construct. Bus. Soc. 38(3): 268-295. 

Carter, C. R. 2008. Purchasing and social responsibility: A replica- 
tion and extension. J. Supply Chain Manag. 40(4): 4-16. 

Charnes, A., W. Cooper, E. Rhodes. 1978. Measuring the efficiency of 
decision making units. Eur. J. Oper. Res. 2(6): 429-444. 

Chatterji, A. K., D. I. Levine. 2006. Breaking down the wall of codes: 
Evaluating non-financial performance measurement. Calif. 
Manage. Rev. 48(2): 29-51. 

Chatterji, A. K., D. I. Levine, M. W. Toffel. 2009. How well do social 
ratings actually measure corporate social responsibility? J. Econ. 
Manage. Strategy 18(1): 125-169. 

Chen, C.-M., M. A. Delmas, M. J. Montes. 2010. Eco- vs. productive 
efficiency: A new approach to effective and comparative perfor- 
mance analysis. UC Santa Barbara: Institute for Social, Behavioral, 
and Economic Research. Retrieved from: http://www.escholar 
ship.org/uc/item/36v7p4fj (accessed November 9, 2010). 


Chen, C.-M., J. van Dalen. 2010. Measuring dynamic efficiency: 
Theories and an integrated methodology. Eur. J. Oper. Res. 
203(3): 749-760. 

Chen, J., D. Patten, R. Roberts. 2008. Corporate charitable contribu- 
tions: A corporate social performance or legitimacy strategy? 
J. Bus. Ethics 82(1): 131-144. 

Cho, C., D. Patten, R. Roberts. 2006. Corporate political strategy: An 
examination of the relation between political expenditures, en- 
vironmental performance, and environmental disclosure. J. Bus. 
Ethics 67(2): 139-154. 


Clarkson, M. B. E. 1995. A stakeholder framework for analyzing 
and evaluating corporate social performance. Acad. Manage. 
Rev. 20(1): 92-117. 

Cook, W., J. Zhu. 2006. Rank order data in DEA: A general frame- 
work. Eur. J. Oper. Res. 174(2): 1021-1038. 


Cooper, W., L. Seiford, K. Tone. 2006. Data Envelopment Analysis: 
A Comprehensive Text with Models, Applications, References and 
DEA-Solver Software. Springer Verlag, New york. 


Dawkins, C. 2002. Corporate welfare, corporate citizenship, and the 
question of accountability. Bus. Soc. 41(3): 269-291. 

Deckop, J., K. Merriman, S. Gupta. 2006. The effects of CEO pay struc- 
ture on corporate social performance. J. Manage. 32(3): 329-342. 

Delmas, M., M. Toffel. 2008. Organizational responses to environ- 
mental demands: Opening the black box. Strateg. Manage. J. 
29(10): 1027-1055. 

Delmas, M. A., V. Doctori-Blass. 2010. Corporate environmental 
performance: The trade-offs of sustainability ratings. Bus. Strat- 
egy Environ. 19(4): 245-260. 

Delmas, M. A., I. Montiel. 2009. Greening the supply chain: When is 
customer pressure effective? J. Econ. Manage. Strategy 18(1): 
171-201. 


Delmas, M. A., M. V. Y. Russo, M. J. Montes-Sancho. 2007. Dereg- 
ulation and environmental differentiation in the electric utility 
industry. Strateg. Manage. J. 28(2): 189-209. 

Delmas, M. A., Y. Tokat. 2005. Deregulation, efficiency and gover- 
nance structures: The U.S. electric utility sector. Strateg. Manage. 
J. 26(5): 441-460. 


Delquié, P. 1997. Bi-matching: A new preference assessment method 
to reduce compatibility effects. Manage. Sci. 43(5): 640-658. 


Drumwright, M. E. 1994. Socially responsible organizational buying: 
Environmental concern as a noneconomic buying criterion. 
J. Mark. 58(3): 1-19. 

Dyckhoff, H., K. Allen. 2001. Measuring ecological efficiency with 
data envelopment analysis (DEA). Eur. J. Oper. Res. 132(2): 
312-325. 


Fare, R., S. Grosskopf, C. Pasurka Jr. 2006. Social responsibility: US 
power plants 1985-1998. J. Prod. Anal. 26(3): 259-267. 

Freeman, A. M. 2003. The Measurement of Environmental and Resource 
Values: Theory and Methods. RFF Press, Washington, DC. 

Gond, J.-P., A. Crane. 2010. Corporate social performance disori- 
ented: Saving the lost paradigm? Bus. Soc., [DOI 10.1177/ 
0007650308315510]. 


Goto, M., M. Tsutsui. 1998. Comparison of productive and cost effi- 
ciencies among Japanese and US electric utilities. Omega 26(2): 
177-194. 

Graves, S., S. Waddock. 1994. Institutional owners and corporate 
social performance. Acad. Manage. J. 37(4): 1034-1046. 

Griffin, J. 2000. Corporate social performance: Research directions 
for the 21st century. Bus. Soc. 39(4): 479—491. 


Griffin, J., J. Mahon. 1997. The corporate social performance and 
corporate financial performance debate: Twenty-five years of 
incomparable research. Bus. Soc. 36(1): 5-31. 

Hillman, A., G. Keim. 2001. Shareholder value, stakeholder man- 
agement, and social issues: What’s the bottom line? Strateg. 
Manage. J. 22(2): 125-139. 

Hirsch, P. M., D. Z. Levin. 1999. Umbrella advocates versus validity 
police: A life-cycle model. Organ. Sci. 10(2): 199-212. 

Hull, C., S. Rothenberg. 2008. Firm performance: The interactions of 
corporate social performance with innovation and industry 
differentiation. Strateg. Manage. J. 29(7): 781-789. 


Igalens, J., J.-P. Gond. 2005. Measuring corporate social performance 
in France: A critical and empirical analysis of ARESE data. 
J. Bus. Ethics 56(2): 131-148. 

Johnson, R., D. Greening. 1999. The effects of corporate governance 
and institutional ownership types on corporate social perfor- 
mance. Acad. Manage. J. 42(5): 564-576. 

Kane, G., U. Velury, B. Ruf. 2005. Employee relations and the like- 
lihood of occurrence of corporate financial distress. J. Bus. 
Financ. Account. 32(5-6): 1083-1105. 


Kempf, A., P. Osthoff, A. Platz. 2007. The effect of socially respon- 
sible investing on portfolio performance. Eur. Financ. Manage. 
13(5): 908-922. 

Kennelly, J., E. Lewis. 2002. Degree of internationalization and 
corporate environmental performance: Is there a link. Int. 
J. Manage. 19(3): 478-489. 


Kuosmanen, T., M. Kortelainen. 2007. Valuing environmental factors 
in cost-benefit analysis using data envelopment analysis. Ecol. 
Econ. 62(1): 56-65. 

Landier, A., V. Nair, J. Wulf. 2009. Trade-offs in staying close: Cor- 
porate decision making and geographic dispersion. Rev. Financ. 
Stud. 22(3): 1119-1148. 

Loureiro, M. L., J. Lotade. 2005. Do fair trade and eco-labels in coffee 
wake up the consumer conscience? Ecol. Econ. 53(1): 129-138. 


Luce, R., A. Barber, A. Hillman. 2001. Good deeds and misdeeds: A 
mediated model of the effect of corporate social performance on 
organizational attractiveness. Bus. Soc. 40(4): 397-415. 


Majumdar, S. K., A. A. Marcus. 2001. Rules versus discretion: The 
productivity consequences of flexible regulation. Acad. Manage. 
J. 44(1): 170-179. 

Marquis, C., M. Glynn, G. Davis. 2007. Community isomorph- 
ism and corporate social action. Acad. Manage. Rev. 32(3): 
925-945. 

Mattingly, J., S. Berman. 2006. Measurement of corporate social ac- 
tion: Discovering taxonomy in the Kinder Lydenburg Domini 
ratings data. Bus. Soc. 45(1): 20-46. 

McGuire, J., S. Dow, K. Argheyd. 2003. CEO incentives and corpo- 
rate social performance. J. Bus. Ethics 45(4): 341-359. 
McWilliams, A., D. Siegel. 2000. Corporate social responsibility and 
financial performance: Correlation or misspecification? Strateg. 
Manage. J. 21(5): 603-609. 


Chen and Delmas: Measuring Corporate Social Performance with DEA 


804 Production and Operations Management 20(6), pp. 789-804, © 2010 Production and Operations Management Society 


McWilliams, A., D. Siegel. 2001. Corporate social responsibility: 
A theory of the firm perspective. Acad. Manage. J. 26(1): 
117-127. 

Mitchell, R., B. Agle, D. Wood. 1997. Toward a theory of stakeholder 

identification and salience: Defining the principle of who and 

what really counts. Acad. Manage. Rev. 22(4): 853-886. 

Neiling, E., E. Webb. 2009. Corporate social responsibility and finan- 

cial performance: The “virtuous circle” revisited. Rev. Q. Financ. 

Account. 32(2): 197-209. 

Neubaum, D., S. Zahra. 2006. Institutional ownership and cor- 
porate social performance: The moderating effects of 
investment horizon, activism, and coordination. J. Manage. 
32(1): 108-131. 

Podinovski, V. V., E. Thanassoulis. 2007. Improving discrimination 

in data envelopment analysis: Some practical suggestions. 

J. Prod. Anal. 28(1): 117-126. 


Rehbein, K., S. Waddock, S. B. Graves. 2004. Understanding share- 
holder activism: Which corporations are targeted? Bus. Soc. 
43(3): 239-267. 

Rowley, T., S. Berman. 2000. A brand new brand of corporate social 
performance. Bus. Soc. 39(4): 397-418. 


Ruf, B., K. Muralidhar, R. Brown, J. Janney, K. Paul. 2001. An 
empirical investigation of the relationship between change 
in corporate social performance and financial perfor- 
mance: A stakeholder theory perspective. J. Bus. Ethics 32(2): 
143-156. 

Ruf, B., K. Muralidhar, K. Paul. 1998. The development of a sys- 
tematic, aggregate measure of corporate social performance. 
J. Manage. 24(1): 119-133. 

Seuring, S., M. Müller. 2008. From a literature review to a conceptual 
framework for sustainable supply chain management. J. Clean. 
Prod. 16(15): 1699-1710. 

Shropshire, C., A. Hillman. 2007. A longitudinal study of sign- 
ificant change in stakeholder management. Bus. Soc. 46(1): 
63-87. 

Simar, L., P. W. Wilson. 2007. Estimation and inference in two-stage, 
semi-parametric models of production processes. J. Econ. 136(1): 
31-64. 

Simerly, R. 2003. An empirical examination of the relationship 
between management and corporate social performance. Int. 
J. Manage. 20(3): 353-359. 

Srivastava, S. K. 2007. Green supply-chain management: A state- 
of-the-art literature review. Int. J. Manage. Rev. 9(1): 53-80. 


Strike, V. M., J. Gao, P. Bansal. 2006. Being good while being bad: 
Social responsibility and the international diversification of US 
firms. J. Int. Bus. Stud. 37(6): 850-862. 

Thomas, A., R. Simerly. 1995. Internal determinants of corporate 
social performance: The role of top managers. Academy of 
Management Journal Best Paper Proceedings, Vancouver, 
Canada, pp. 411-415. 


Turban, D., D. Greening. 1996. Corporate social performance and 
organizational attractiveness to prospective employees. Acad. 
Manage. J. 40(3): 658-672. 

Van der Laan, G., H. Van Ees, A. Van Witteloostuijn. 2008. Corporate 
social and financial performance: An extended stakeholder 
theory, and empirical test with accounting measures. J. Bus. 
Ethics 79(3): 299-310. 

Waddock, S. 2003. Myths and realities of social investing. Organ. 
Environ. 16(3): 369-380. 

Waddock, S., S. Graves. 1997a. Quality of management and quality 
of stakeholder relations: Are they synonymous? Bus. Soc. 36(3): 
250-279. 

Waddock, S., S. Graves. 1997b. The corporate social performance- 
financial performance link. Strateg. Manage. J. 18(4): 303-319. 

Waldman, D., D. Siegel, M. Javidan. 2006. Components of CEO 
transformational leadership and corporate social responsibility. 
J. Manage. Stud. 4(8): 1703-1725. 

Webb, E. 2004. An examination of socially responsible firms’ board 
structure. J. Manage. Gov. 8(3): 255-277. 

Wokutch, R. E., E. W. McKinney. 1991. Behavioral and perceptual 
measures of corporate social performance. Post, J. E. ed. 
Research in Corporate Social Performance and Policy 12. JAI Press, 
Greenwich, CT, 309-330. 

Wolfe, R. 1991. The use of content analysis to assess corporate social 
responsibility. Post, J. E. ed. Research in Corporate Social Perfor- 
mance and Policy 12. JAI Press, Greenwich, CT, 281-308. 


Zahra, S. A., B. M. Oviatt, K. Minyard. 1993. Effects of corporate 
ownership and board structure on corporate social responsi- 
bility and financial performance. Academy of Management 
Journal Best Paper Proceedings, Atlanta, GA, pp. 336-340. 

Zhou, P., B. W. Ang, K. L. Poh. 2009. A survey of data envelopment 
analysis in energy and environmental studies. Eur. J. Oper. Res. 
189(1): 1-18. 

Zhu, J. 2003. Imprecise data envelopment analysis (IDEA): A review 
and improvement with and application. Eur. J. Oper. Res. 144(3): 
513-529. 


